Augmenting Thesaurus Relationships: Possibilities for Retrieval
نویسندگان
چکیده
This paper discusses issues concerning the augmentation of thesaurus relationships, in light of new application possibilities for retrieval. We first discuss a case study that explored the retrieval potential of an augmented set of thesaurus relationships by specialising standard relationships into richer subtypes, in particular hierarchical geographical containment and the associative relationship. We then locate this work in a broader context by reviewing various attempts to build taxonomies of thesaurus relationships and conclude by discussing the feasibility of hierarchically augmenting the core set of thesaurus relationships, particularly the associative relationship. We discuss the possibility of enriching the specification and semantics of RT relationships, while maintaining compatibility with traditional thesauri via a limited hierarchical extension of the associative (and hierarchical) relationships. This would be facilitated by distinguishing the type of term from the (sub)type of relationship and explicitly specifying semantic categories for terms following a faceted approach. We first illustrate how hierarchical spatial relationships can be used to provide more flexible retrieval for queries incorporating place names in applications employing online gazetteers and geographical thesauri. We then employ a set of experimental scenarios to investigate key issues affecting use of the associative (RT) thesaurus relationships in semantic distance measures. Previous work has noted the potential of RTs in thesaurus search aids but also the problem of uncontrolled expansion of result sets. Results presented in this paper suggest a potential for taking account of the hierarchical context of an RT link and specialisations of the RT relationship.
منابع مشابه
بررسی مقایسهای روابط معنایی، ساختار شکلی و سیستم مدیریت اصطلاحنامههای فنی ـ مهندسی و نما
Purpose: Thesauri as important tools in storage and retrieval information systems have a significant role in the optimization of database search. So the publishing of thesauri needs to use standards as much as possible. I examined and compared two important thesauruses on the basis of ANSI/NISO z39.19 2005. Methodology: This study is an analytical and applied survey. The study population was t...
متن کاملLarge-Scale Linguistic Ontology as a Basis for Text Categorization of Legislative Documents
The paper describes the structure and properties of a large linguistic ontology – a new kind of information retrieval thesaurus Thesaurus on Sociopolitical Life for Conceptual Indexing. The thesaurus is used in various realscale information-retrieval applications in the legal domain. At present one of the main applications of the Thesaurus is knowledge-based text categorization. Categories are ...
متن کاملInformation Retrieval and the Thesaurus
Introduction This paper describes various developments of the retrieval system devised in Cambridge last year, which we described in a paper 'The Thesaurus Approach to Information Retrieval' (Amer. Doc. 1958). These are in part concerned with mechanically setting up the system, and in part with the interesting possibilities of achieving better retrieval by having more flexible search procedures...
متن کاملAugmenting Domain-Specific Thesauri with Knowledge from Wikipedia
We propose a new method for extending a domain-specific thesaurus with valuable information from Wikipedia. The main obstacle is to disambiguate thesaurus concepts to correct Wikipedia articles. Given the concept name, we first identify candidate mappings by analyzing article titles, their redirects and disambiguation pages. Then, for each candidate, we compute a link-based similarity score to ...
متن کاملTheSys - A comprehensive thesaurus system for intelligent document analysis and text retrieval
Well designed thesauri can represent seman-tic/conceptual knowledge so as to reveal relationships among diierent elements in documents, thus serving as a critical tool in intelligent text retrieval systems and document analysis systems. In this paper, we present a thesaurus system, referred to as TheSys, which can be used as a tool for users to build thesauri according to their own requirements...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- J. Digit. Inf.
دوره 1 شماره
صفحات -
تاریخ انتشار 2001